Power - Aware High - Performance Cache Memory
نویسنده
چکیده
iii Acknowledgements iv Abstract CMOS technology scaling in recent decades has enabled a phenomenal performance improvement in microprocessors by allowing designers to increase the level of integration at higher clock frequencies. Unfortunately, technology scaling has also created unprecedented design challenges, including, but not limited to, the devices' leakage current, the interconnect delay and the clock distribution [5,10,11]. These design challenges, accompanied by a number of architectural innovations, often require serious revisits on the conventional architectural techniques that were acceptable in the past. In this paper, we investigate the impact of CMOS technology scaling on the future high-performance cache designs and observe that leakage energy in high-performance L1 cache designs is tightly coupled with the cache's bitline precharging schemes. Based on the investigation, we expect that conventional bitline precharging schemes will soon be unusable due to their prohibitive leakage energy dissipation, as CMOS technology scales. Therefore, we revisit the high-performance cache's bitline precharging schemes and propose novel schemes which are necessary to resolve the conventional schemes' unprecedentedly large and growing leakage energy waste for a wide spectrum of CMOS technology. High-performance caches statically pull up the bitlines in all cache subarrays to optimize cache access latency. Unfortunately, such an architecture results in significant energy waste in nanoscale CMOS implementations due to high leakage and bitline discharge in the unaccessed subarrays. Recent research advocates bitline isolation to control precharging of individual subar-rays using bitline precharge devices. In this paper, we carefully evaluate the energy and performance trade-offs of bitline isolation, and propose techniques to exploit bitline isolation's potential to eliminate discharge and reduce overall energy in level-one caches. v We propose and evaluate an alternative bitline precharging scheme based on bitline isolation. Bitline isolation is a simple and intuitive technique to reduce leakage energy dissipation in unused bitlines by only precharging in-use bitlines. However, we show that applying bitline isolation is not straightforward. Applying bitline isolation exhibits performance and energy trade-offs depending on CMOS technology, thus it must be done with careful architectural considerations in mind. For past and current CMOS technologies, we propose resizable caches, which reduces about 60% to 70% of the bitline discharge without exposing energy and performance implications of bit-line isolation. Gated precharging, our simple hardware-based bitline precharging mechanism based on the subarray reference locality for future CMOS technologies, captures most of the potential of bitline isolation. The technique removes 93% and 96% of the conventional scheme's energy …
منابع مشابه
Reduction in Cache Memory Power Consumption based on Replacement Quantity
Today power consumption is considered to be one of the important issues. Therefore, its reduction plays a considerable role in developing systems. Previous studies have shown that approximately 50% of total power consumption is used in cache memories. There is a direct relationship between power consumption and replacement quantity made in cache. The less the number of replacements is, the less...
متن کاملReduction in Cache Memory Power Consumption based on Replacement Quantity
Today power consumption is considered to be one of the important issues. Therefore, its reduction plays a considerable role in developing systems. Previous studies have shown that approximately 50% of total power consumption is used in cache memories. There is a direct relationship between power consumption and replacement quantity made in cache. The less the number of replacements is, the less...
متن کاملLow Power Aging-Aware On-Chip Memory Structure Design by Duty Cycle Balancing
The degradation of CMOS devices over the lifetime can cause severe threat to the system performance and reliability at deep submicron semiconductor technologies. The negative bias temperature instability (NBTI) is among the most important sources of the aging mechanisms. Applying the traditional guardbanding technique to address the decreased speed of devices is too costly. On-chip memory struc...
متن کاملUsing Silent Writes in Low-Power Traffic-Aware ECC
Using Error Detection Code (EDC) and Error Correction Code (ECC) is a noteworthy way to increase cache memories robustness against soft errors. EDC enables detecting errors in cache memory while ECC is used to correct erroneous cache blocks. ECCs are often costly as they impose considerable area and energy overhead on cache memory. Reducing this overhead has been the subject of many studies. In...
متن کاملDevice-Aware Cache Replacement Algorithm for Heterogeneous Mobile Storage Devices
Abstract. Hard disks, most prevalent mass-storage devices, have high power consumption and high response time for random I/O requests. Recent remarkable technology improvement of flash memory has made it a rising secondary storage device but flash memory still has high cost per bit. Usage of heterogeneous storage devices such as a pair of a hard disk and a flash memory can provide reasonable co...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2003